
SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) APPLICANT: 



0»Malley, Bert W. 
Tsai, Ming-Jer 
Ledebur, Harry C. Jr, 
Kittle, Joseph b. Jr, 



(ii) TITLE OF INVENTION: 



MODIFIED STEROID 
HORMONES FOR GENE 
THERAPY AND METHODS 
FOR THEIR USE 



(iii) NUMBER OF SEQUENCES: 



14 



(iv) CORRESPONDENCE ADDRESS; 



(A) 
(B) 

(C) 
(D) 
(E) 
(F) 



ADDRESSEE : 
STREET : 

CITY: 
STATE : 
COUNTRY : 
ZIP: 



Lyon Sc Lyon 

633 West Fifth Street 

Suite 4700 

Los Angeles 

California 

U.S.A. 

90071-2066 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 

(B) COMPUTER: 

(C) OPERATING SYSTEM: 

(D) SOFTWARE: 



3.5" Diskette, 1.44 Mb 
storage 

IBM Compatible 
IBM P.C. DOS 5.0 
Word Perfect 5 . 1 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 



08/959,013 
October 28, 1997 
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(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 



(viii) ATTORNEY/AGENT INFORMATION: 



(A) NAME: Warburg, Richard J. 

(B) REGISTRATION NUMBER: 32,327 

(C) REFERENCE/DOCKET NUMBER: 226/286 



(ix) TELECOMMUNICATION INFORMATION: 



(A) TELEPHONE: 

(B) TELEFAX: 

(C) TELEX: 



(213) 489-1600 
(213) 955-0440 
67-3510 



(2) INFORMATION FOR SEQ ID NO : 1: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 

(B) TYPE: 

(C) STRANDEDNESS : 

(D) TOPOLOGY: 

(ii) MOLECULE TYPE: 



6177 base pairs 
nucleic acid 
double 
linear 

nucleic acid- 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



CTAGAGTCGA CCTGCAGCCC AAGCTCTCGA GGGATCCTGA GAACTTCAGG GTGAGTTTGG 60 

GGACCCTTGA TTGTTCTTTC TTTTTCGCTA TTGTAAAATT CATGTTATAT GGAGGGGGCA 120 

AAGTTTTCAG GGTGTTGTTT AGAATGGGAA (3ATGTCCCTT GTATCACCAT GGACCCTCAT 180 

GATAATTTTG TTTCTTTCAC TTTCTACTCT GTTGACAACC ATTGTCTCCT CTTATTTTQT 240 

TTTCATTTTC TGTAACTTTT TCGTTAAACT TTAGCTTGCA TTTGTAACGA ATTTTTAAAT 300 

TCACTTTTGT TTATTTGTCA GATTGTAAGT ACTTTCTCTA ATCACTTTTT TTTCAAGGCA 360 

ATCAGGGTAT ATTATATTGT ACTTCAGCAC AGTTTTAGAG AACAATTGTT ATAATTAAAT 420 

GATAAGGTAG AATATTTCTG CATATAAATT CTGGCTGGCG TGGAAATATT CTTATTGGTA 480 

GAAACAACTA CATCCTGGTC ATCATCCTGC CTTTCTCTTT ATGGTTACAA TGATATACAC 540 

TGTTTGAGAT GAGGATAAAA TACTCTGAGT CCAAACCGGG CCCCTCTGCT AACCATGTTC 600 

ATGCCTTCTT CTTTTTCCTA CAGCTCCTGG GCAACGTGCT GGTTGTTGTG CTGTCTCATC 660 

ATTTTGGCAA AGAATTCACT CCTCAGGTGC AGGCTGCCTA TCAGAAGGTG GTGGCTGGTG 720 

TGGCCAATGC CCTGGCTCAC AAATACCACT GAGATCTTTT TCCCTCTGCC AAAAATTATG 780 

GGGACATCAT GAAGCCCCTT GAGCATCTGA CTTCTGGCTA ATAAAGGAAA TTTATTTTGA 84 0 
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TTGCAATAGT GTGTTGGAAT TTTTTGTGTC TCTCACTCGG AAGGACATAT GGGAGGGCAA 900 

ATCATTTAAA ACATCAGAAT GAGTATTTGG TTTAGAGTTT GGCAACATAT GCCATATGCT 960 

GGCTGCCATG AACAAAGGTG GCTATAAAGA GGTCATCAGT ATATGAAACA GCCCCCTGCT 1020 

GTCCATTCCT TATTCCATAG AAAAGCCTTG ACTTGAGGTT AGATTTTTTT TATATTTTGT 1080 

TTTGTGTTAT TTTTTTCTTT AACATCCCTA AAATTTTCCT TACATGTTTT ACTAGCCAGA 1140 

TTTTTCCTCC TCTCCTGACT ACTCCCAGTC ATAGCTGTCC CTCTTCTCTT ATGAACTCGA 1200 

GGAGCTTTTT GCAAAAGCCT AGGCCTCCAA AAAAGCCTCC TCACTACTTC TGGAATAGCT 1260 

CAGAGGCCGA GGCGGCCTCG GCCTCTGCAT AAATAAAAAA AATTAGTCAG CCATGGGGCG 1320 

GAGAATGGGC GGAACTGGGC GGAGTTAGGG GCGGGATGGG CGGAGTTAGG GGCGGGACTA 1380 

TGGTTGCTGA CTAATTGAGA CTGCATTAAT GAATCGGCCA ACGCGCGGGG AGAGGCGGTT 1440 

TGCGTATTGG GCGCTCTTCC GCTTCCTCGC TCACTGACTC GCTGCGCTCG GTCGTTCGGC 1500 

TGCGGCGAGC GGTATCAGCT CACTCAAAGG CGGTAATACG GTTATCCACA GAATCAGGGG 1560 

ATAACGCAGG AAAGAACATG TGAGCAAAAG GCCAGCAAAA GGCCAGGAAC CGTAAAAAGG 1620 

CCGCGTTGCT GGCGTTTTTC CATAGGCTCC GCCCCCCTGA CGAGCATCAC AAAAATCGAC 1680 

GCTCAAGTCA GAGGTGGCGA AACCCGACAG GACTATAAAG ATACCAGGCG TTTCCCCCTG 1740 

GAAGCTCCCT CGTGCGCTCT CCTGTTCCGA CCCTGCCGCT TACCGGATAC CTGTCCGCCT 1800 

TTCTCCCTTC GGGAAGCGTG GCGCTTTCTC AATGCTCACG CTGTAGGTAT CTCAGTTCGG 1860 

TGTAGGTCGT TCGCTCCAAG CTGGGCTGTG TGCACGAACC CCCCGTTCAG CCCGACCGCT 1920 

GCGCCTTATC CGGTAACTAT CGTCTTGAGT CCAACCCGGT AAGACACGAC TTATCGCCAC 1980 

TGGCAGCAGC CACTGGTAAC AGGATTAGCA GAGCGAGGTA TGTAGGCGGT GCTACAGAGT 2040 

TCTTGAAGTG GTGGCCTAAC TACGGCTACA CTAGAAGGAC AGTATTTGGT ATCTGCGCTC 2100 

TGCTGAAGCC AGTTACCTTC GGAAAAAGAG TTGGTAGCTC TTGATCCGGC AAACAAACCA 2160 

CCGCTGGTAG CGGTGGTTTT TTTGTTTGCA AGCAGCAGAT TACGCGCAGA AAAAAAGGAT 2220 

CTCAAGAAGA TCCTTTGATC TTTTCTACGG GGTCTGACGC TCAGTGGAAC GAAAACTCAC 22,80 

GTTAAGGGAT TTTGGTCATG AGATTATCAA AAAGGATCTT CACCTAGATC CTTTTAAATT 234 0 

AAAAATGAAG TTTTAAATCA ATCTAAAGTA TATATGAGTA AACTTGGTCT GACAGTTACC 2400 

AATGCTTAAT CAGTGAGGCA CCTATCTCAG CGATCTGTCT ATTTCGTTCA TCCATAGTTC 2460 

CCTGACTCCC CGTCGTGTAG ATAACTACGA TACGGGAGGG CTTACCATCT GGCCCCAGTG 2520 

CTGCAATGAT ACCGCGAGAC CCACGCTCAC CGGCTCCAGA TTTATCAGCA ATAAACCAGC 2580 

CAGCCGGAAG GGCCGAGCGC AGAAGTGGTC CTGCAACTTT ATCCGCCTCC ATCCAGTCTA 264 0 

TTAATTGTTG CCGGGAAGCT AGAGTAAGTA GTTCGCCAGT TAATAGTTTG CGCAACGTTG 2700 

TTGCCATTGC TACAGGCATC GTGGTGTCAC GCTCGTCGTT TGGTATGGCT TCATTCAGCT 2760 

CCGGTTCCCA ACGATCAAGG CGAGTTACAT GATCCCCCAT GTTGTGCAAA AAAGCGGTTA 2820 

GCTCCTTCGG TCCTCCGATC GTTGTCAGAA GTAAGTTGGC CGCAGTGTTA TCACTCATGG 2880 

TTATGGCAGC ACTGCATAAT TCTCTTACTG TCATGCCATC CGTAAGATGC TTTTCTGTGA 2940 

CTGGTGAGTA CTCAACCAAG TCATTCTGAG AATAGTGTAT GCGGCGACCG AGTTGCTCTT 3000 

GCCCGGCGTC AATACGGGAT AATACCGCGC CACATAGCAG AACTTTAAAA GTGCTCATGA 3060 

TTGGAAAACG TTCTTCGGGG CGAAAACTCT CAAGGATCTT ACCGCTGTTG AGATCCAGTT 3120 

CGATGTAACC CACTCGTGCA CCCAACTGAT CTTCAGCATC TTTTACTTTC ACCAGCGTTT 3180 

CTGGGTGAGC AAAAACAGGA AGGCAAAATG CCGCAAAAAA GGGAATAAGG GCGACACGGA 324 0 

AATGTTGAAT ACTCATACTC TTCCTTTTTC AATATTATTG AAGCATTTAT CAGGGTTATT 3300 

GTCTCATGAG CGGATACATA TTTGAATGTA TTTAGAAAAA TAAACAAATA GGGGTTCCGC 3360 

GCACATTTCC CCGAAAAGTG CCACCTGACG TCTAAGAAAC CATTATTATC ATGACATTAA 3420 

CCTATAAAAA TAGGCGTATC ACGAGGCCCT TTCGTCTTCA AGCTGCCTCG CGCGTTTCGG 34 80 

TGATGACGGT GAAAACCTCT GACACATGCA GCTCCCGGAG ACGGTCACAG CTTGTCTGTA 354 0 

AGCGGATGCC GGGAGCAGAC AAGCCCGTCA GGGCGCGTCA GCGGGTGTTG GCGGGTGTCG 3600 

GGGCGCAGCC ATGACCCAGT CACGTAGCGA TAGCGGAGTT GGCTTAACTA TGCGGCATCiA 3660 

GAGCAGATTG TACTGAGAGT GCACCATATC GACGCTCTCC CTTATGCGAC TCCTGCATTA 3 720 

GGAAGCAGCC CAGTAGTAGG TTGAGGCCGT TGAGCACCGC CGCCGCAAGG AATGGTGCTG 3780 

GCTTATCGAA ATTAATCGAC TCACTATAGG GAGACCCGAA TTCGAGCTCG CCCCGTTACA 3840 

TAACTTACGG TAAATGGCCC GCCTGGCTGA CCGCCCAACG ACCCCCGCCC ATTGACGTCA 3900 

ATAATGACGT ATGTTCCCAT AGTAACGCCA ATAGGGACTT TCCATTGACG TCAATGGGTG 3960 

GAGTATTTAC GGTAAACTGC CCACTTGGCA GTACATCAAG TGTATCATAT GCCAAGTACG 4 020 

CCCCCTATTG ACGTCAATGA CGGTAAATGG CCCGCCTGGC ATTATGCCCA GTACATGACC 4 080 

TTATGGGACT TTCCTACTTG GCAGTACATC TACGTATTAG TCATCGCTAT TACCATGGTG 414 0 

ATGCGGTTTT GGCAGTACAT CAATGGGCGT GGATAGCGGT TTGACTCACG GGGATTTCCA 4200 

AGTCTCCACC CCATTGACGT CAATGGGAGT TTGTTTTGGC ACCAAAATCA ACGGGACTTT 4260 
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CCAAAATGTC GTAACAACTC CGCCCCATTG ACGCAAATGG GCGGTAGGCG TGTACGGTGG 4320 

GAGGTCTATA TAAGCAGAGC TCGTTTAGTG AACCGTCAGA TCGCCTGGAG ACGCCATCCA 4380 

CGCTGTTTTG ACCTCCATAG AAGACACCX3G GACCGATCCA GCCTCCGCGG GATCTTGGTG 4440 

GCGTGAAACT CCCGCACCTC TTCGGCCAGC GCCTTGTAGA AGCGCGTATG GCTTCGTGGG 4500 

GATCCCCCAA AGAATCCTTA GCTCCCCCTG GTAGAGACGA AGTCCCTGGC AGTTTGCTTG 4560 

GCCAAGGGAG GGGGAGCGTA ATGGACTTTT ATAAAAGCCT GAGGGGAGGA GCTACAGTCA 4620 

AGGTTTCTGC ATCTTCGCCC TCAGTGGCTG CTGCTTCTCA GGCAGATTCC AAGCAGCAGA 4680 

GGATTCTCCT TGATTTCTCG AAAGGCTCCA CAAGCAATGT GCAGCAGCGA CAGCAGCAGC 4740 

AGCAGCAGCA GCAGCAGCAG CAGCAGCAGC AGCAGCAGCA GCAGCAGCCA GGCTTATCCA 4800 

AAGCCGTTTC ACTGTCCATG GGGCTGTATA TGGGAGAGAC AGAAACAAAA GTGATGGGGA 4860 

ATGACTTGGG CTACCCACAG CAGGGCCAAC TTGGCCTTTC CTCTGGGGAA ACAGACTTTC 4920 

GGCTTCTGGA AGAAAGCATT GCAAACCTCA ATAGGTCGAC CAGCGTTCCA GAGAACCCCA 4980 

AGAGTTCAAC GTCTGCAACT GGGTGTGCTA CCCCGACAGA GAAGGAGTTT CCCAAAACTC 5040 

ACTCGGATGC ATCTTCAGAA CAGCAAAATC GAAAAAGCCA GACCGGCACC AACGGAGGCA 5100 

GTGTGAAATT GTATCCCACA GACCAAAGCA CCTTTGACCT CTTGAAGGAT TTGGAGTTTT 5160 

CCGCTGGGTC CCCAAGTAAA GACACAAACG AGAGTCCCTG GAGATCAGAT CTGTTGATAG 5220 

ATGAAAACTT GCTTTCTCCT TTGGCGGGAG AAGATGATCC ATTCCTTCTC GAAGGGAACA 5280 

CGAATGAGGA TTGTAAGCCT CTTATTTTAC CGGACACTAA ACCTAAAATT AAGGATACTG 5340 

GAGATACAAT CTTATCAAGT CCCAGCAGTG TGGCACTACC CCAAGTGAAA ACAGAAAAAG 54 00 

ATGATTTCAT TGAACTTTGC ACCCCCGGGG TAATTAAGCA AGAGAAACTG GGCCCAGTTT 54 60 

ATTGTCAGGC AAGCTTTTCT GGGACAAATA TAATTGGTAA TAAAATGTCT GCCATTTCTG 5520 

TTCATGGTGT GAGTACCTCT GGAGGACAGA TGTACCACTA TGACATGAAT ACAGCATCCC 5580 

TTTCTCAGCA GCAGGATCAG AAGCCTGTTT TTAATGTCAT TCCACCAATT CCTGTTGGTT 5640 

CTGAAAACTG GAATAGGTGC CAAGGCTCCG GAGAGGACAG CCTGACTTCC TTGGGGGCTC 5700 

TGAACTTCCC AGGCCGGTCA GTGTTTTCTA ATGGGTACTC AAGCCCTGGA ATGAGACCAG 5760 

ATGTAAGCTC TCCTCCATCC AGCTCGTCAG CAGCCACGGG ACCACCTCCC AAGCTCTGCC 5820 

TGGTGTGCTC CGATGAAGCT TCAGGATGTC ATTACGGGGT GCTGACATGT GGAAGCTGCA 5880 

AAGTATTCTT TAAAAGAGCA GTGGAAGGAC AGCACAATTA CCTTTGTGCT GGAAGAAACG 5940 

ATTGCATCAT TGATAAAATT CGAAGGAAAA ACTGCCCAGC ATGCCGCTAT CGGAAATGTC 6000 

TTCAGGCTGG AATGAACCTT GAAGCTCGAA AAACAAAGAA AAAAATCAAA GGGATTCAGC 6060 

AAGCCACTGC AGGAGTCTCA CAAGACACTT CGGAAAATCC TAACAAAACA ATAGTTCCTG 6120 

CAGCATTACC ACAGCTCACC CCTACCTTGG TGTCACTGCT GGAGGTGATT GAACCCG 6177 



(2) INFORMATION FOR SEQ ID NO: 2: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

GTACGTTTAA ACGCGGCGCG CCGTCGACCT GCAGAAGCTT ACTAGTGGTA CCCCATGGAG 60- 
ATCTGGATCC GAATTCACGC GTTCTAGATT AATTAAGC 98 
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• 



(2) INFORMATION FOR SEQ ID NO: 3: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GGCCGCTTAA TTAATCTAGA ACGCGTGAAT TCGGATCCAG ATCTCCATGG GGTACCACTA 60 

GTAAGCTTCT GCAGGTCGAC GGCGCGCCGC GTTTAAAC 98 



(2) INFORMATION FOR SEQ ID NO: 4: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 

(B) TYPE: 

(C) STRANDEDNESS: 

(D) TOPOLOGY: 



51 base pairs 
nucleic - acid 
single 
linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GATCTCGGTC TCCAACAGCA ACAGCAACAG CAACAGCT^C AGGGTCTTCT G 



51 



(2) INFORMATION FOR SEQ ID NO: 5: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 

(B) TYPE: 

(C) STRANDEDNESS: 

(D) TOPOLOGY: 



51 base pairs 
nucleic acid 
single 
linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5: 
GATCCAGAAG ACCCTGTTGC TGTTGCTGTT GCTGTTGCTG TTGGAGACCG A 



51 
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(2) INFORMATION FOR SEQ ID NO: 6: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
AATTCCCCGA GGCGGCAGCT GAAATCATCA CCAATCAGAT CT 42 



(2) INFORMATION FOR SEQ ID NO: 7: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

TATGCCTTAC CATGTGGC ' 18 



(2) INFORMATION FOR SEQ ID NO: 8: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs' 

(B) TYPE: nucleic. acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
TTGGTCGACA AGATCATGCA TTATC 25 
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(2) INFORMATION FOR SEQ ID NO: 9: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 

(B) TYPE: 

(C) STRANDEDNESS : 

(D) TOPOLOGY: 



28 base pairs 
nucleic acid 
single 
linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9; 
TTGTCGACCC GCAGTACAGA TGAAGTTG 



28 



(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 

(B) TYPE: 

(C) STRANDEDNESS: 

(D) TOPOLOGY: 



3 0 base pairs 
nucleic acid 
single 
linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
TTGGTCGACC CAGCAATAAC TTCAGACATC 



30 



(2) INFORMATION FOR SEQ ID NO: 11: 



(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 

(B) TYPE: 

(C) STRANDEDNESS: 

(D) TOPOLOGY: 

(xi) SEQUENCE DESCRIPTION: 
CGACAGATCT GGCTCCTGAG CAAAGAGAA 



29 base pairs 
nucleic acid 
single 
linear 

SEQ ID NO: 11: 

29 
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(2) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 

(B) TYPE: 

(C) STRANDEDNESS : 

(D) TOPOLOGY: 



24 base pairs 
nucleic acid 
single 
linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CCAGGGATCC TCTCCTTGCT GCAA 



24 



(2) INFORMATION FOR SEQ ID NO: 13; 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 

(B) TYPE: 

(C) STRANDEDNESS: 

(D) TOPOLOGY: 



33 base pairs 
nucleic acid 
single 
linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
TCTAGTCGAC GATGGCTCCT GAGCAAAGAG AAG 



33 



(2) INFORMATION FOR SEQ ID NO: 14: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CCAGGGATCC TATCCTTGCT GCAACAG 27 
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